Simultaneous Estimation of Dimension, States and Measurements: Gram estimations 



(N 

o 

(N 
Oh 

in 

(N 



I 



> 
m 

OV 

o: 



X 



Cyril Stark 1 

institute for Theoretical Physics, ETH Zurich, 8093 Zurich, Switzerland 

We study a systematic procedure to find effective quantum models describing measurement data. 
Experiments (e.g., involving superconducting qubits) have shown that we do not always have a 
good understanding of how to model the measurements with positive operator valued measures 
(POVMs). It turns out that the ad hoc postulation of POVMs can lead to inconsistencies. For 
example when doing asymptotic state tomography via linear inversion, one sometime ends up with 
density matrices that are not positive semidefinite. We propose an alternative procedure where we 
do not make any a priori assumptions on the quantum model, i.e., on the Hilbert space dimension, 
the states or the POVMs. In this paper, we take the first steps along this program by estimating 
the Gram matrix associated with the states and the measurements. The Gram matrix specifies the 
Hilbert space dimension and determines all the states and the POVM elements, up to simultaneous 
rotations in the space of Hermitian matrices. We are guided by Occam's razor, i.e., we search for 
the minimal quantum model consistent with the data. In an upcoming paper we will show how the 
explicit valid density matrices and POVM elements can be found, using a heuristic algorithm that 
takes the state-measurement Gram matrix as input. 



Roughly speaking, the goal of statistics is the infer- 
ence of probability measures associated with random ex- 
periments. Probability theory then takes these proba- 
bility measures as an input to make predictions about 
the measurement data of future experiments. Thus, the 
practical use of probability theory relies on statistics. 
The same holds true in the quantum mechanical set- 
ting. The kinematics of quantum mechanics is a sta- 
tistical model. It comes along with a number of free 
parameters: Hilbert space dimension, density matrices 
associated with the states and POVM elements describ- 
ing the measurements. To make predictions about future 
measurement outcomes, or to analyze dynamics (process 
tomography), these degrees of freedom need to be spec- 
ified. Commonly, to do so, one starts by postulating 
a Hilbert space dimension d and assumes that one ex- 
plicitly knows a set of measurements (state tomography) 
or a set of states (measurement tomography). In both 
scenarios we have to make assumptions, either on the 
involved states or the performed measurements. This 
is not only unsatisfactory from a theoretical viewpoint 
but also from a practical perspective. For instance, it 
turns out that linear inversion in the quasi-asymptotic 
regime 17| sometimes yields matrices that are not pos- 



itive semidefinite [l| and therefore, they are not always 
proper quantum states or measurements. Typically, this 
issue is merely circumvented: the matrix reported is the 
one that is in some sense closest to the result obtained 
by linear inversion, while still in the cone of positive 
semidefinite matrices. However, the observation that the 
matrices so computed are not valid quantum mechani- 
cal operators simply shows that our initial assumptions 
(about the dimension, states or measurements) were in- 
accurate. Had they been correct, linear inversion would 
have generated positive semidefinite matrices. We at- 
tempt to resolve these problems by the development of a 
transparent strategy to simultaneously and consistently 



estimate the Hilbert space dimension, the states and the 
measurements. 

In this paper, we describe how the Gram matrix asso- 
ciated with the involved density matrices and POVM el- 
ements can be estimated. This Gram matrix determines 
the minimal Hilbert space dimension, the states, and the 
measurements up to simultaneous rotations (with respect 
to the Hilbert-Schmidt inner product) of all the involved 
density matrices and POVM elements. Thus, it fixes all 
the relative geometric relationships between elements of 
the set of states and measurements. In we will show 
how the explicit density matrices and POVM elements 
can be found, using a heuristic algorithm that takes the 
state-measurement Gram matrix as input. 

It is common to assign deviations between the pos- 
tulated states (measurements) and the states prepared 
(measurements performed) to the class of 'systematic er- 
rors'. In the authors describe two consistency tests to 
falsify assumptions about the experimental setup. Tem- 
poral drifts in the experimental setup have been analyzed 
in [ij . Other papers focused on implications of erroneous 
assumptions about the theoretical description of the per- 
formed experiments: pseudo violation of Bell inequali- 
ties [j| and false entanglement witnessing A recent 
exposition of systematic errors can be found in 0] . There, 
the authors moreover describe methods to modify entan- 
glement witnesses to deal with systematic errors. Other 
works aim at estimating the Hilbert space dimension via 
linear witnesses ||. 

We begin by formulating the precise problem, and show 
that up to simultaneous rotations of the states and the 
POVM elements, it can be reduced to finding the ap- 
propriate state-measurement Gram matrix. We describe 
how this Gram matrix can be found via convex relaxation 
of the original problem, and provide numerical examples. 



1 



2 



SETUP 



THE SIMPLEST QUANTUM MECHANICAL 
MODEL 



In what follows, experiments are highly abstracted. 
Prom the data analysis perspective an experiment is a 
black box with two knobs: JC s t to choose the state to 
be prepared, and /C m to specify the measurement to be 
performed. The knobs /C st and /C m admit adjustments 
w £ {1,...,W} and v £ {1,...,V} respectively. To exe- 
cute the experiment, the experimentalist chooses specific 
positions (w,v) for JC s t and IC m . Subsequently, the black 
box provides the experimentalist with one out of K v pos- 
sible outcomes: k £ {1, K v }. Of course, the experi- 
mentalist typically sees a quasi-contincous signal and not 
a set of discrete outcomes. Here, however, we regard the 
mapping of the directly observed quasi-continuous sig- 
nal onto the set of discrete outcomes {1, K v } as being 
part of the black box. In the following, we are going 
to write K instead of K v , thus suppressing the depen- 
dence of if on w. Rerunning the experiment with the 
same adjustments (w,v) can lead to a different outcome 
k ^ k. Repeating the experiment N times, we can count 
how many times we obtained outcome "1" , outcome "2" , 
outcome "if". Dividing these numbers by the to- 
tal number of experiments N WjV , we obtain frequencies 
(fw,(v,i),-,fw,(v,K)) for the outcomes "1", "if". Re- 
running the experiment for all possible choices for (w, v), 
we arrive at the following data table: 



V 



( A. (i,i) 
A,(i,i) 



A,(l,if) 
A,(l,-K") 



\ /w,(l,l) ' ' ' fw,{l,K) 



A,(v,i) 
A,(v,i) 

fw,(v,i) 



fl,(V,K) \ 
fw,(V,K) / 



Thus, the row index of the data table T> enumerates 
the adjustments of the knob /C s t while the column in- 
dex of T> enumerates the adjustments of the knob K, m in 
combination with the associated measurement outcomes. 
In the following we are assuming the asymptotic limit 
N vw — > oo for all choices (w,v). This allows us to iden- 
tify the entries of the data table T> with probabilities: 
given that the knob positions are (w,v), the probability 
Pw,(v,k) f° r measuring outcome "fc" equals f w i Vt k)- From 
the viewpoint of quantum mechanics, these probabilities 
are given via Born's rule: 



fw,(v,k) — Pw,(v,k) — te(PwE v k)- 



(1) 



Here, p w is the density matrix that corresponds to adjust- 
ment l w' of the knob JC s t and (E v ^)k is the POVM cor- 
responding to the adjustment 'v' of the knob K. m . In the 
remainder we are assuming that the states and POVMs 
are pairwise independent. 



We aim at finding the simplest quantum mechanical 
model that is compatible with the data V. By 'simplest' 
we mean that the number of degrees of freedom in the 
description must be as small as possible (Occam's razor). 
The number of degrees of freedom in the quantum model 
is determined by the dimension d of its Hilbert space be- 
cause the number of states, the number of measurements 
and the number of outcomes are fixed by the data ta- 
ble D. Hence, we say that a model A is simpler than a 
model B if the Hilbert space dimension of A is smaller 
than the Hilbert space dimension of B. Let d be the 
unknown Hilbert space dimension. The density matrices 
p w and the POVM elements E v k are matrices in C dxd . 
Let B denote a Hilbert-Schmidt orthonormal basis in the 
space of Hermitian matrices in C dxd . With respect to B 
we can express all density matrices and POVM elements 
in terms of vectors p w , E vk £ M. d because the Hermitian 
matrices on C d form a real d 2 -dimcnsional vector space. 
Due to the orthonormality of B, 



Define 

P={pi 
and 



tx{p w E vk ) = (p w ) T E vk . 



p w | Eu | • • • | E 1K | • • • | E vl 



G = P P. 



(2) 



Evk) 



(3) 



(4) 



The matrix G is the Gram matrix associated with p\, 
Evk- The data table T> appears as off-diagonal block in 
the states-measurement Gram matrix: 



G = 



G a t 


V 


V T 


G m 



(5) 



Note that Gv = implies v T Gv = (Pv) T (Pv) = 
|P?;|| 2 = 0, leading to Pv = 0. On the other hand, 
Pv = implies Gv = P T Pv = 0. Therefore, G and P 
have identical null spaces, and consequently, 

rank(G) = rank(P). (6) 

Thus, the minimal Hilbert space dimension d satisfies 

d = min{n £ N \ n 2 > rank(G)}. (7) 

We conclude that finding the minimal quantum model 
with dimension d is equivalent to finding a Gram ma- 
trix G of minimal rank which is compatible (in the sense 
of (H|)) with the data V. More precisely, the Gram ma- 
trix G associated with the simplest quantum model is a 
solution of the optimization problem jl8l| 



argmin rank G 
subject to G £ 5qm, 

Gi : w+j = P>i. 



(8) 



3 



Here and in the remainder we are assuming i = 1,...,W 
and j — 1 , . . . , VK to keep the exposition as simple as 
possible. But in situations where not all ©-entries are 
known, we are free to loosen the constraints in Eq. ([5]) 
accordingly. In Eq. © , we used Gqm to denote the set of 
Gram matrices that can be generated via density matri- 
ces and POVM elements. Thus, the set Gqm is contained 
in the set S + of all real- valued, positive semidefinite ma- 
trices. When trying to solve ((8]) we are facing a major 
difficulty: the optimization problem (|8|) is not convex 
(thus leading to local minima [l9[) because the rank of a 
matrix is not a convex function. For example, 

2 = rank(p|0)<0| + (l-p) |1)(1|) 

> prank(|0)(0|) +(l-p) rank(|l)(l|) = 1. (9) 

Moreover, we do not know how to characterize Gqm- 
Below we are showing how these issues can be approxi- 
mately resolved by replacing the rank minimization prob- 
lem to a closely related convex optimization problem that 
can be solved efficiently. Before we continue, we note that 
yj rank(£>) , which can computed from the data, provides 
via Eq. (J7J a lower bound for d, 

Vrank(P) < d, (10) 
because rank(2?) < rank(G). 

COMPUTING THE GRAM MATRIX VIA 
CONVEX RELAXATION 

Rank minimization problems are not convex. Thus, 
the problem <JSj> in its general form suffers from local 
minima. A practical approach to solve non-convex op- 
timization problems is to relax them to the closest con- 
vex optimization problem. This is illustrated in Fig. [1] 
instead of trying to find the global minimum of the non- 
convex function f(x), we are computing the minimum 
of the function g(x). Then, the convexity of g(x) guar- 
antees that the found minimum is a global minimum of 
g(x). A function g : C — > R from a convex set C to R is 
the convex envelope of a function / : C — ^ 1R. if it is the 
pointwise largest convex function satisfying g(x) < fix) 
for all x G C. Note that this property depends on the 
convex set C. This becomes evident when we replace the 
choice C = R in Fig. [1] to an interval, e.g., to C = [—1,1]. 
Recall that Gqm C S + . Our goal is to approximately 
solve the optimization problem ([8|) by replacing the rank 
function by its convex envelope with respect to the con- 
vex set 

C ■= {X e R" x " | X > 0, < M QM }, (11) 
(||.|| denotes the operator norm) where 

Mqm '■= sup{||A"|| | X G Gqm}, (12) 



i.e., Mqm is the radius of the smallest operator norm 
ball containing the set of quantum Gram matrices Gqm ■ 
In [§], Fazel, Hindi and Boyd proved that ||X||i/Mqm is 
the convex envelope of the rank function on the larger 
set 

C := {X e K nx ™ | \\X\\ < Mqm} D C. (13) 

Here, ||.||i denotes the trace norm. Since C is a proper 
subset of the base set C it is unclear whether or not 
||A"||i/Mqm is also the convex envelope of the rank func- 
tion with respect to C (see Fig.rjJ). That this is indeed the 
case will be proven later by a straightforward adaption of 
the derivation given in Q. Note that on C, \\ ■ ||i = tr(-). 
This motivates the substitution of the non-convex opti- 
mization problem (|5J) for the convex optimization prob- 
lem 

argmin tr G 

subject to G > 0, ||G|| < M QM , (14) 

The optimization problem (|14|) can even be cast into a 
semidefinite program: the constraint ||G|| < Mqm in 
Eq. dHJ) is equivalent to Mqm I - G > because G > 0. 
Set 

Z = Mq M 1 - G = (W + Vd)I — G. (15) 

Then, the optimization problem (| 14[) is equivalent to the 
semidefinite program 

argmin tr G 

subject to diag(G, Z) > 0, (16) 

because 

Z > 0, G + Z = M QM I, M QM I - G > 0. (17) 

As a consequence of being an instance of semidefinite pro- 
gramming, the optimization problem (|16D can be solved 
efficiently by standard methods [13, 1 1 1| - 

COMPUTING THE BOUND Mqm 

The purpose of this section is the computation of the 
upper bound Mqm- We start with 

l|G|l <I|G|| 2 , (18) 

where J|G||2 = tr(G*G), which holds true for general ma- 
trices. The bound (fT8)) is tight for G <G Gqm because for 
every Hilbert space dimension d and for every rank of 
G, we can choose the vectors pi, Evk (the columns 
of P; cf. Eq. ([3])) such that they are almost parallel to 
MI. Thus, for any choice of rank(G) = rank(P), G can 



4 



become arbitrarily close to a positive semidefinite rank-1 
matrix (corresponds to all columns of P being parallel). 
Thus, the vector of eigenvalues of G becomes arbitrarily 
close to the vector (||G||, 0, 0) T , i.e., ||G|| becomes ar- 
bitrarily close to ||G||2- Consequently, the upper bound 
(|T5| is tight. We continue by observing that 

||G|| 2 <||G||^||^P||2<||py 

(\ 2 
W+VK 



(19) 



(W V K \ z 

E EE M 
w=l v=l k=l / 

In the second inequality, we have used the sub- 
multiplicativity of the Hilbert-Schmidt norm. The 
Hilbert-Schmidt norm of quantum states is lower 
bounded by the norm of the maximally mixed state and 
upper bounded by the norm of pure states. Consequently, 



\Pjh e 



1/Vd,l 



\pj\h 



< l. 



(20) 



The condition ^ fe E v f~ = I implies 



d = 



En^feii2 + E tr ^ fe ^«) 

k k^q 



(21) 



and therefore, 



En* 



fc|l2 



< d 



(22) 



because tr(MiV) > whenever M,N > 0. This upper 
bound is tight because it is achieved by projective, non- 
degenerate measurements. Using Eq. ([22]) and Eq. ([20]) 
in Eq. (|19[) . we arrive at 



Af QM = W + Vd. 



(23) 



FINDING THE CONVEX RELAXATION WITH 
RESPECT TO S' + nB||.||< 1 

In § Fazel, Hindi and Boyd proved that j| • ||i is the 
convex envelope of the rank function on the set of matri- 
ces X with ||X|| < 1. In the following we present a mod- 
ification of their argument to show that the trace (and 
hence still || • ||i) is the convex envelope of the rank func- 
tion when restricting the above ball of matrices \\X\\ < 1 
to its intersection with the cone of positive semidefinite 
matrices, i.e., IeS + fl ^||.||<i- 

Recall that for an arbitrary function / : C — > R, C 
convex, 

f*(y)=suv{(y,x)-f(x) \xeC} 



f{x) / / g(x) 



FIG. 1: (Color online) The function g(x) is the convex enve- 
lope of f(x), i.e., it is the largest convex function that point- 
wise lower bounds f(x). 



is its conjugate. The convex envelope of the rank function 
with respect to the convex set 



C:={X e 



X>0, \\X\\<1} 



is rank**, i.e., the double-conjugate with respect to C; 
see [lH ■ Observe that 

rank*(Y~) = sup{tr(FX) - rank(X)} 



max{ sup {tr(YX)-l},..., sup {tr(YX)-n}\. 

(24) 



xec, 

rank(X) = l 



XGC, 
rank(Jf) = 



Here, Y is an arbitrary Hermitian (n x n) matrix (recall 
that the Hermitian matrices form the vector space car- 
rying S + ). Due to their Hermiticity, both X and Y can 
be diagonalized orthogonally, 



X = Y j e{X)MX) ] ){s(X) 3 



j=i 



(25) 



Y = J2e(Y)MY) 3 )(e(Y) 3 



In the remainder we are assuming that all the eigenvalues 
are sorted descendingly. We observe that 

n / n 

tr(YX) = E^(n E £ ™< £ ™ £ ™. 

i=i \j=i J w 

= e(Y) T Qe(X), 

where Q is the doubly stochastic matrix Qij = 
|(e(X)i|e(y)^>| 2 . Let s be such that e{Y) j > for j < s 
and s(Y)j < for j > s. Consider a term "m", m < s, 
from Eq. ([2"5|l. i.e., 

sup {e(Y) T Qe(X)~m}, 

rank(X)— m 



5 



We claim that 

e(Y) T Qe(X)<e(Y) T (l,...,l,0,...,0) T ,VQ,e(X), 



m— times 



is a tight upper bound. Consider 

maximize e{Y) T Q e{X) 
subject to Q doubly stochastic. 



(27) 



(28) 



The optimization problem ([28|) is linear. It follows that 
the optimum is achieved at an extremal point. The dou- 
bly stochastic matrices form a polytope whose vertices 
are the permutation matrices (Birkhoff-von Neumann 
theorem). Hence, a solution Q to (|2"5|) is a permutation 
matrix. An optimal choice is Q = I because e(X) and 
e(Y) are ordered descendingly, and 

(x l ,y) < (x 1 ^ 1 ) 

for arbitrary vectors x, y £ R™ (see Corollary II. 4. 4 
in [HI). Consequently, Q = I, e.g., via 



\e(X) 3 



solves (|28| independently of the specific values of s{X) 
and e(Y). To conclude the proof that Eq. (|27|) describes 
a tight upper bound, we have to solve 



maximize s{Y) T e(X) 

subject to X > 0, ||X|| < 1, rank(X) = m. 

The constraints imply 

(0,...,0) T <e(A)<(l,...,l,0,...,0) T 



(29) 



m— times 



(componentwise). As to < s, the l.h.s of Eq. (|27|) 
becomes maximal for the componentwise maximum of 
e(X), i.e., for 



e(A) = (!,..., 1,0,..., Of 



(30) 



m — times 



This proves that the upper bound in Eq. (|27|) is correct 
and tight. In case of to > s, non-zero choices of e(X)j, 
s < j < m, lead to negative contributions to the l.h.s of 
Eq. (f2"7| . Hence, in case of to > s, the choice 



(31) 



epO = (l,..,l,0,...,0) J 

s — times 



realizes the tight upper bound. Combining Eq. (|30|) and 
Eq. (|3T|) . we arrive at 

sup {e(y) T QepO} -to 

xec, 

rank(X)— m 



form< S 

-(to — s) + E! = i(£pOi - l), for to > s 



(32) 



To choose the optimal to (recall Eq. (124]) ). we note that 
to i — y tjx ~\~ 1 is profitable as long as e(Y) m — 1 > 0. Using 
the compact notation a+ = max{a,0}, we conclude 



raiUK*(F) = £>m--l) 



(33) 



To determine rank**(Z), we can copy and paste the 
Fazel-Hindi-Boyd arguments @ . We repeat them for the 
reader's convenience: 

rank**(Z) = sup {ti(ZY) - rank* (Y)} (34) 

Y=Y T 

for all Z > and \\Z\\ < 1. Define 

f2 := {tr(ZF) - rank*(F)}. (35) 
We consider the two cases ||F|| < 1 and \\Y\\ > 1, 

rank**(Z) = max< sup f2, sup fl>. (36) 

^Y=Y T . Y — Y T , ' 

l|v||<i ||y||>i 

Assume < 1. Then, as a consequence of Eq. 
rank*(y) = 0, and therefore, 

sup fl = sup {tr(ZY)}. 

Y—Y T . Y—Y T , 

I|V||<1 \\Y\\<1 

By von Neumann's trace theorem [bit ]. 

tr(ZY) < e{Z) T e{Y). (38) 

This upper bound can be achieved by choosing Y, such 
that 

\e(Y) 3 ) := MZ)i),Vj. 
Consequently, going back to Eq. (|37|) . 



(37) 



sup Q =max 



t(Z) T e(Y) 



subject to \e(Y)j\ < 1, Vj. 



(39) 



Since componentwise < e(Z) < 1, e^V) = (1, 1) T is 
the optimal choice. It follows that 



sup Q = z( z )j = tr ( z )- 



(40) 



Y=Y 
\\Y\\<1 



This concludes the discussion of || Y\ < 1. Assume ||F|| > 
1 . Note that rank* (Y ) is independent of our choice of the 
F-eigenvectors \e(Y)j). Hence, in Eq. (|34|) . we choose 

\e(Y) 3 ) := WZ)i),Vj, 

as before to reach the von Neumann-upper bound in 
Eq. flUEl). Thus, 



sup 17= sup {e(Z) T e(Y) -rank*(F)} (41) 

y=y T , e(Y)!>i 

\\Y\\>1 



6 



leading to 

n s 

sup n = sup ^( £ (z) j£ (r),)-^( £ (r) 3 -i). 

Y=Y T , eWi^lfei 7=1 

I|V||>1 

(42) 

Here, s is chosen such that e(Y)j > 1 for j < s and 
e(F)j < 1 for j > s. As in Q, we continue by the 
addition and the subtraction of ^3=1 £ (^)j- 



sup fi= sup S^^^-EC 6 ^- 1 ) 

I|V||>1 



X ' X (^3 



3=1 



3 = 1 



= sup J2(s(Y) j -l)(e(Z) j -l) 

E(y)i>i J= i 

n n 
3=8+1. 3 '=1 

(43) 

In this last expression, the first sum is negative semidcf- 
inite because \\Z\\ < 1, and the second sum is negative 
scmidcfinite because by definition of s, s(Y)j < 1 for all 
j > s. Therefore, 



sup n < ^2e(Z)j = tr(Z). 

y=y T , 3=1 
ll>-||>i 



(44) 



Hence, using y with ||Y|| > 1 brings no advantage (com- 
pare Eq. £101) and Eq. (gH)). Going back to Eq. (J3BJ), wc 
conclude 



rank**(Z) = tr(Z), 



(45) 



i.e., the convex envelope of the matrix rank function over 
the set S + fl -By ii<i is the matrix trace. 



CONSISTENCY TEST 

Even though closely related, the problems ([8]) and (TIB"]) 
are not identical. There exists no guarantee that the 
global optimum of the scmidcfinite program (|16|) and the 
global optimum of the rank minimization ([8} agree. In 
fact, when doing explicit computations, it indeed hap- 
pens, that the convex relaxation (|16|) sometimes fails to 
find the solution to the original rank minimization prob- 
lem. We suggest to proceed as follows by running the 
consistency test, Algorithm [I] on the computed solution 
of the convex relaxation (fl6|) . We are interpreting the 
failure of the consistency test as the failure of the trace 
minimization (|16[) to find the optimum of the rank min- 
imization (151). 



Algorithm 1 Procedure including consistency test 
Require: Data V. 
1: Run the optimization (|16[) . 

2: r <- rank(X>), r st <— rank(G st ), r m <— rank(G m ). 
3: while (r / r 8 t) V (r / r m ) do 
4: Get more data, 

5: or: abort the analysis to reconfigure the experiment. 
6: Run the optimization (|16p . 
7: end while 



The purpose of the remainder of this section is the 
justification for Algorithm [TJ Define 

Pst = (Pi I • • • I Pw), Pm = (En I • • • I Evk), (46) 



vd-'xW 



Pm G 



pd'xVif 



, so that P = (P Bt | P m ) 



(cf. Eq. ([3|)). We can only have hope to successfully 
reconstruct the states and the measurements if 



rank(F st ) = d 2 = rank(P m ). 



(47) 



To understand, why Eq. (|47| needs to be satisfied, as- 
sume that rank(P st ) > rank(P m ). Then, even if we knew 
all the POVM elements, we could not reconstruct the 
states because the POVM elements cannot form a basis 
in the space carrying the states. On the other hand, we 
would not be able to reconstruct the POVM elements 
correctly in case of rank(P st ) < rank(P m ), even if wc 
knew all the states. Next, we are showing that Eq. (|47|) 
implies 

rank(G s t) = rank(X>), rank(G m ) = rank(X>). (48) 

This forms the justification for the consistency test in 
Algorithm [1] because a violation of Eq. (|4"8"|) implies that 
the necessary condition (|47|) cannot hold true. To prove 
the assertion (|48|) . we recall the following two inequalities 
about the rank of matrices A <G M™ xm , B e K" xm : 

rank(A) + rank(P) - n < rank(A T P) 

< min{rank(^4),rank(P)}. (49) 

The first relation is sometimes called Sylvester's rank in- 
equality. For A = P st and B = P m and postulating that 
Eq. (g7|) holds true, it follows that 



2d 2 - d 2 < rank(PjP m ) < d 2 , 

so that rank (D) = d 2 . Hence, 

rank(G s t) = rank(P st ) = d 2 = rank(P), 
rank(G m ) = rank(P m ) = d 2 = rank(P), 

as claimed in Eq. 



(50) 



(51) 



NON-UNIQUENESS 

The following construction shows that the data table 
T> does not uniquely determine the states and the POVM 



7 



elements if not both the set of states {p w } w and the set 
of POVM elements {E vk } v k contain rank-deficiant ma- 
trices, i.e., matrices sitting on the boundary of the cone 
of positive semidefinite matrices S + . The construction 
is independent of the total number of states, the total 
number of measurements, and the dimension of the un- 
derlying Hilbert space. We begin with the expansion of 
all the matrices corresponding to states and POVM el- 
ements with respect to a generalized Bloch basis. More 
precisely, we decompose Herm(rf), the space of Hermitian 
matrices in C dxd , as follows: 

Herm(d) = M © T 

where T is the subspace carrying all traceless matrices 
and I denotes the identity matrix. In T we choose an 

,2 i 

arbitrary ortho normal basis {(Tj}j_[ . We denote the 
expansion coefficients associated with the full basis as 
follows: 

A w ,j = tr(a jp w ) 

Pvk,n = tT(a n E v k) 

a w = tr((I/Vd)p w ) = 1/Vd 

p vk = tr((l/Vd)E vk ). (52) 

Thus, expanding with respect to the orthonormal basis 
{I/y/d, (cj)j)} and using that the basis elements <jj are 
traceless, we get 

tr{p w E vk ) = —J=3vh + ^ A w ,j/^fc,rat.r((7j(T ra ) 

j,n 

= —^ftvk + ^Zfivk- (53) 

Next we observe that for every t G M. d - 1 , & ^ 0, the 
transformation 

(X w .j, Pvkj) ^i,-^-' •• ,' j (54) 

of the states and POVM elements leaves the associ- 
ated data T> unchanged and preserves the constraints 
tr(p w ) = 1 and J2 k E vk = I. The values £ can take 
are limited by the constraint that the transformed states 
and POVM elements must be positive semidefinite ma- 
trices. Consequently, the ^-scaling leads to a continuous 
manifold of states-measurement configurations that is 25- 
compatible as long as none of the involved matrices sits 
on the boundary of the cone of positive semidefinite ma- 
trices. 

However, if some POVM elements and some states are 
elements of the boundary of the cone of positive semidef- 
inite matrices, then the ^-scaling cannot fulfill anymore 
the constraint of mapping positive semidefinite matrices 
onto positive semidefinite matrices. Hence, the consid- 
ered counterexample does not cover cases involving states 



and measurements which are rank-deficient. Rather, it 
seems that the closer the measurements and the states 
are to being projective respectively pure, the smaller is 
the set of Gram matrices which are 2?-compatiblc. 

NUMERICAL EXAMPLES 

Having discussed how to relax the rank minimization 
problem, we are now going to compute some specific ex- 
amples by running Algorithm [TJ As discussed in the 
previous section, the measurement data V generically 
does not determine the states-measurement Gram ma- 
trix uniquely. However, it is a necessary condition for the 
uniqueness of the 25-compatible states and measurements 
that the states-measurement Gram matrix is uniquely de- 
termined by V. Here, uniqueness of the states and the 
POVM elements is always understood modulo the funda- 
mental ambiguity described by the simultaneous rotation 
of all states and POVM elements via the conjugation with 
unitary or anti-unitary matrices. Therefore, when de- 
manding uniqueness, we have to fix entries of G that are 
not yet fixed by the data table T>. Determining whether 
or not a specific pattern of a priori known G-entries suf- 
fices to guarantee the uniqueness of the estimated Gram 
matrix is an open question which will be analyzed in an 
upcoming paper. 

Let O denote the index set marking the a priori known 
entries of G. We are going to conduct Algorithm [I] 
for different Hilbert space dimensions d and for differ- 
ent a priori knowledge fl. To start the procedure, we 
require Go such that G is uniquely determined by Gq — 
at least in case of rank(G) = rank(2?). To sample Gq, 
we need to sample explicit density matrices and POVM 
elements. We proceeded by choosing pure states from 
the Haar measure and rotating given projective, non- 
degenerate measurements according to the Haar measure. 
Even though pure states and projective measurements 
play a distinguished role in quantum mechanics, we do 
not expect that the quality of the results differs signif- 
icantly from choices including non-pure states and non- 
projective measurements (at least as long as the states 
and the measurements are not close to being parallel) 
because Algorithm [1] solves a geometric problem in M. d 
and does not probe the quantum nature of the matrices 

,2 

associated with the vectors in R . 

We will observe that the estimations G usually fail the 
consistency test whenever f2, d, W, V and K are such 
that G is not overdetermined by Gsi. In these cases, more 
states need to be prepared and additional measurements 
need to be performed (cf. Algorithm [1J. To conduct 
the consistency test, we need to compare the ranks of 
matrices. In principle, the rank of a matrix is equal to 
the number of its non-zero singular values. However, due 
to small numerical fluctuations in the solutions G, this 
definition is too strict. Rather, one should tolerate small 



8 



variations by setting to zero singular values that are very 
small. In the consistency test we need to compare the 
ranks of G s t and G m with the rank of T>. We proceed by 
defining a threshold r := 1(U 4 and 



{a 



rank'D+l j 



(55) 



with Sj denoting the singular values of G s t (sorted de- 
scendingly). Then, we choose the following criterion to 
decide whether or not the ranks of G s t and T> agree: 



rank(G s t) ~ T rank(2?) ||s||2 < t, 



(56) 



and analogously for G m . Our results are summarized in 
Table H The left most column describes the considered 
scenario; the number is equal to the Hilbert space dimen- 
sion, the letter 'A' refers to 'the diagonal of G is known' 
and the letter 'B' refers to 'measurements are projective 
and non-degenerate'. The second and the third columns 
list the number of successful respectively failed recon- 
structions of the full Gram matrix G. Here, 



'failure' :•<=>• maxlG; 



G 



(corr) 

y 



> 10 



-3 



(57) 



with G and G' corr ) denoting the estimated and the cor- 
rect Gram matrix respectively. 'Start point' refers to the 
number of states and measurements we start the conduc- 
tion of Algorithm Q] with. These start points are always 
chosen such that Go determines G uniquely. If the trace 
minimization (|16p fails the consistency test, we alternat- 
ingly increase the number of states and measurements 
by 1 and re-run the trace minimization (|16p. cf. Algo- 
rithm [p To do the actual computation, we used Se- 
DuMi [nil for the smaller problems and TFOCS [ll| for 
the larger problems. 

TABLE I: Numerical experiments 





successes 


failures 


start point 


solver 


2A 


1000 





(5,5) 


SeDuMi [10J 


2B 


2383 





(5,5) 


SeDuMi [10] 


3A 


1367 





(3,30) 


SeDuMi [10J 


3B 


1000 





(60,100) 


TFOCS [11] 


4A 


1040 





(30,120) 


TFOCS [11] 


4B 


1000 





(65,130) 


TFOCS [11] 



MISSING CHARACTERIZATION OF Qqm 

To compute the Gram matrix, we have to run the opti- 
mization problem (fl~6|) — including our knowledge about 
the Gram matrix as constraints. Afterwards, we com- 
pute the rank of G and check via a method described 
in (lij whether or not G is uniquely determined by Go. 



Here, the only thing we are sweeping under the carpet 
is our lack of knowledge about whether or not G £ Gqm 
(cf. Eq. ©). As long as we are missing a precise char- 
acterization of Gqm or the dimension of the underlying 
Hilbert space, we do not know how to guarantee easily 
that the simplest quantum Gram matrix compatible with 
Go does not have higher rank than the computed matrix 
G. This issue will be discussed more thoroughly in [l5| . 
The only procedure we could come up with goes as fol- 
lows: To settle the question whether or not G € Gqm, 
we try to find explicit density matrices and POVM ele- 
ments reproducing G. A heuristic method (taking G as 
input) to compute explicit density matrices and POVM 
elements on the basis of a Gram matrix will be presented 
elsewhere. 



CONCLUSIONS 



We started off guided by Occam's razor: we intended 
to find the simplest quantum model describing the data 
table D (cf. Eq. (JTJ) ) . Thus, among all Incompatible 
quantum models, we search for those quantum models 
whose Hilbert space dimension is minimal. The rank of 
the states- measurement-carrying matrix P (cf. Eq. ([5])) 
realizes a lower bound on d 2 . Choosing the simplest 
quantum model amounts to choosing P compatible with 
the data such that rank(P) is minimal. Up to simulta- 
neous rotations of the columns of P, P and the Gram 
matrix G = P T P determine each other uniquely. Choos- 
ing the simplest quantum model amounts to choosing G 
compatible with the data such that rank(G) = rank(P) 
is minimal. We described how this difficult task can be 
relaxed to the closely related convex optimization prob- 
lem in Eq. (|16jl which can be solved by standard meth- 
ods. If explicit expressions for the density matrices and 
POVM elements are required, the computed Gram ma- 
trix G turns out to be necessary to compute these real- 
izations via a method that will be described in an up- 
coming paper. After having discussed the convex relax- 
ation of Gram matrix searches, we addressed the question 
whether or not these Gram matrices are uniquely deter- 
mined by the data table T>. A simple counterexample 
showed that this is not the case if all the involved states 
and POVM elements are full-rank. Consequently, to 
uniquely fix the state-measurement Gram matrix, one re- 
quires additional knowledge about the G-entries that are 
not contained in T>. Determining, which index-subsets 
of G lead to unique G-estimations, will be part of flBj ) . 
Finally, we computed a few thousand explicit numerical 
examples to confirm that the solution of the rank min- 
imization problem (jSJ) can be solved successfully via its 
convex relaxation (1161) . 



9 



ACKNOWLEDGEMENTS 

I wish to express my gratitude to Johan Aberg for his 
most extensive and kind support throughout this project. 
Moreover, I would like to thank Matthias Christandl, 
Dejan Dukaric, David Gross, Philippe Faist, Patrick 
Plctscher, Sharon Wulff, Renato Renner, Lfdia del Rio, 
and Mario Ziman for discussions that helped me to shape 
this note. I acknowledge support from the Swiss National 
Science Foundation through the National Centre of Com- 
petence in Research "Quantum Science and Technology" . 



[1] J. M. Chow, J. M. Gambetta, A. D. Corcoles, S. T. 
Merkel, J. A. Smolin, C. Rigetti, S. Poletto, G A. Keefe, 
M. B. Rothwell, J. R. Rozen, M. B. Ketchen, and M. Stef- 
fen. Phys. Rev. Lett., 109:060501, 2012. 

[2] C. Stark. Simultaneous estimation of dimension, states 
and measurements: Computation of representative den- 
sity matrices and POVMs. In Preparation. 

[3] T. Moroder, M. Klein mann, P. Schindl er, T. Monz, 
O. Giihne, and R. Blatt. \arXiv:1204.3644\ 2012. 

[4] S.T. Flammia, D. Gross, L. Yi-Kai, S. Becker, and J. Eis- 
ert. \arXiv:1205j3M 2012. 

[5] I. Gerhardt, Q. Liu, A. Lamas-Linares, J. Skaar, 
V. Scarani, V. Makarov, and C. Kurtsiefer. Phys. Rev. 
Lett., 107:170404, 2011. 

[6] A. Acm, N. Gisin, and L. Masanes. Phys. Rev. Lett., 



97:120405, 2006. 
[7] D. Rosset, R Ferretti-Schobitz, J.D Bancal, N. Gisin, and 

Y.C Liang. \arXiv:1203.09il[ 2012. 
[8] M. DallArno, E. Passaro, R. Gallergo, and A. Acm. 

\arXiv: 1207.2574\ 2012 - 
[9] M. Fazel, H. Hindi, and S. Boyd. Proceedings of the 

American Control Conference, 2001. 
[10] J. Sturm. Optimization Methods and Software, 11- 

12:625, 1999. 

[11] S. Becker, E.J. Candes, and M. Grant. Technical report, 
Department of Statistics, Stanford University (Preprint 
available at http://tfocs.stanford.edu/tfocs/paper.shtml), 
2010. 

[12] J.-B. Hiriart-Urruty and C. Lemarechal. Convex Analysis 
and Minimization Algorithms II. Springer, 1993. 

[13] R. Bathia. Matrix Analysis. Springer, 1997. 

[14] RA. Horn and C.R. Johnson. Topics in Matrix Analysis. 
Cambridge University Press, 2006. 

[15] C. Stark. Simultaneous estimation of dimension, states 
and measurements: Rigidity considerations. In Prepara- 
tion. 

[16] B. Recht, M. Fazel, and PA. Parillo. SI AM Review, 
52(53):471, 2010. 

[17] Order 10° measurement repetitions for 2-dimensional 
systems (experiments involving superconducting qubits). 
In these regimes, statistical fluctuations due to finitely- 
many measurement repetitions play a negligible role. 

[18] The operation 'argmin' outputs the optimal decision vari- 
able, i.e., the a Gram matrix that realizes the minimal 
rank. 

[19] Furthermore, rank minimization is even NP hard [3. 



